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By considering diffusion on De Bruijn graphs, we study in detaiis the dynamics of the histories in 
the Minority Game, a modei of competition between adaptative agents. Such graphs describe the 
structure of temporal evolution of M bits strings, each node standing for a given string, i.e. a history 
in the Minority Game. We show that the frequency of visit of each history is not given by 1/2 
in the limit of large M when the transition probabilities are biased. Consequently all quantities 
of the model do significantly depend on whether the histories are real, or uniformly and randomly 
sampled. We expose a self-consistent theory of the case of real histories, which turns out to be in 
very good agreement with numerical simulations. 



I. INTRODUCTION 

The Minority Game [QJ|] has been designed as the most drastic possible simplification of Arthur's El Farol's bar 
problem It is believed to capture some essential and general features of competition between adaptative agents, 
which is found for instance in financial markets. In this model, agents have to take each time step one of two decisions; 
they share a common piece of information p <G {0, • • • , P — 1} that encodes the state of the world, use it to make their 
choice, and those who happen to be in minority are rewarded. In its original formulation, the piece of information 
is the binary encoding of the M last winning choices, hence P — 2 M . Hence, the dynamics of p is coupled to the 
dynamics of agents. 

Cavagna [Q claimed that all quantities of the system "are completely independent from the memory of the agents" . 
This means that replacing the dynamics of p induced by agents by a random history p drawn at random at each time 
step, one finds the same results. While this statement has turned out to be wrong for many extensions of the MG [^| [|, 
it has been helpful as a first approximation for the analytical understanding of the standard MG: an exact solution 
for random histories has been found in the "thermodynamic" limit Interestingly, this solution shows that all 

quantities depend on the frequencies {p^ 1 } of visit of histories. The random history case is recovered if /Z 1 = l/P, but 
in the real dynamics of the MG the distribution p^ is determined by the behavior of agents (indeed modifying the 
behavior of agents may have strong effects on pP as shown in ref. It turns out, that the frequencies are not 

uniform for all parameters of the MG. 

In this paper we study quantitatively this problem. The first step is to characterize the properties of the dynamics of 
real histories, which amounts to study randomly biased diffusion on De Bruijn graphs. Depending on the asymmetry 
of the bias, we quantify the deviation 5p^ = p^ — l/P from the uniform distribution. Then we move to the MG and 
quantify the bias which agents induce on the dynamics of p in the asymmetric phase. Using a simple parameterization 
of p^ which is inferred from numerical data, we generalize the calculations of refs. This leads to a self-consistent 

equation between the asymmetry of the game and the diffusion bias, which we can solve. The results are in excellent 
agreement with numerical simulations and show a systematic deviation from the random history MG. Hence, our 
conclusion is that, even though the random history MG is qualitatively similar to the original MG, memory is 
actually not irrelevant, and one can quantify the difference between the two cases. 

II. DE BRUIJN GRAPHS 



Let us begin with the definition of some elementary concepts. A binary sequence p(t) of length m consists of m 
ordered elements {b(t — m), ■ ■ ■ ,b(t — 1)} where & is a letter belonging to the alphabet {0, 1}. p(t + 1) is obtained by 
adding b(t) to the right of p{t) and erasing b(t — m). Thus, for a given p(t), there are two possible p(t + 1), which we 
call "next neighbours". This updating rule defines the De Bruijn graph 0] of order m (see Fig. Ill for an example). 
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FIG. 1. De Bruijn graph of order 3 



Let G be the P x P adjacency matrix of the De Bruijn graph of order m. if we adopt the convention that its 
elements are indiced by the decimal value of the binary strings, that is, /x = 0, • • • , P — 1, 



Gn 



(1) 



where A%B stands for the remainder of the division of B by A and Sij is the Kronecker symbol. The adjacency 
matrix for in = 3 is: 

/ 1 1 \ 
1 1 



G 
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III. UNBIASED DIFFUSION 

The unbiased diffusion is defined as follows : a particle moves on the directed De Bruijn graph G and at each time 
step t, it jumps with equal probability to one of the next neighbours of the vertex it stands on at this time. Thus 
the transition probabilities matrix is Wo = G/2. In the long run, the fraction of time spent on vertex v is given by 
[{Wo) co ]om- It can be seen (see appendix) that 



l(W ) k 



^ 2*-l 



[2 k fi%P]+n,u- 



n=0 



In particular, (W^ 1 )n,v = -p for all k > 0, that is, all strings fj, are visited with the same frequency 
In order to have a intuitive feeling of those graphs, we write them for M = 3 : 
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(3) 



l/P. 



(4) 



IV. RANDOMLY BIASED DIFFUSION 



The perturbations are introduced by adding a term to the transition probabilities matrix W e 
quantifies the asymmetry and W\ contains the disorder £ 



W + eWi where e 



(5) 
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where the £ are iid from the pdf P(£) = 1/2 6(£ — 1) + 1/2 <5(£ + 1) and the (— l) u comes from the normalization 
of the perturbed probabilities. We are looking for the stationary transition probabilities, i.e., such that = 
linife^oo (W e ) k . It exists since W e is a bounded operator. Its formal series expansion in e is noted by = 



k>0 ' 



where 



is a matrix whose all coefficients are equal to 1/P (see above). The relationship — 



W£°W e provides the recurrence 



(6) 



Since W^Wq° — 0, we iterate m — 1 times this equation by replacing W£° with W£°Wo + WSl J _Wi in the r.h.s., 
yielding to 



w fe °° = w^wiv = [Wivy 



(7) 



where V = J^^Lq 1 (Wq) c - At this point, it is useful to remark that multiplying a matrix on the left by W§° is 
equivalent to averaging its columns : 



p-i 



0=0 



1 P_1 

p E A °- u 



average of the z^-th column of A 



(8) 



thus the matrices W%° consist of averages of columns of (WiV) . Therefore, (W^)^^ is the fc-th order correction to 
the frequency of vertex v, that will be called pY k s in the following. Note that (Pa,)}£ = for all k > 1. The square root 
of the second moment of pfa averaged over the disorder gives an indication of the typical value of pV k \ . In appendix 
O we obtain the approximation 



(l|P(A0l 



(1 - l/P) k 



(9) 



which is exact for the first order perturbation. Therefore p^s is of the same order as the unperturbed pmy thus it 
cannot be neglected. Fig || shows that the behavior predicted by Eq ([)]) is indeed correct for large P. 




FIG. 2. Squared norms of p(u) for k=0,. . . ,4 (circles, squares, diamonds, triangles up, full triangles) (average over 500 samples). 
They decrease as 1/P for large P. The continuous lines are exact theoretical predictions. 

Finally, one can estimate the second moment of p v . If one supposes that the perturbations at different orders are 
independent, one obtains 



p-i 



v = i E [«^ 2 >« - 



u=0 



1 
-pi 



1 



1 - (1 - 1/P)e 2 



1 



(10) 
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V. APPLICATION TO MG 



Let us first define the gamef]: MG consists of N agents trying to be at each time step in minority. Each agent has 
S strategies, or lookup tables a.; jS , s — 1, ■ ■ ■ , S , and dynamically assigns a score to each of them . At each time step 
t, the system's history p{t) is made available to all agents; the latter use their best strategy^ Si(t) take the decision 



a>i s .n\(t) — +1 or —1 and a market maker sums up all decisions into the aggregate quantity A(t) = X^i=i a i S (t)(t) 



Macroscopic quantities of interest include the temporal averages of A(t) conditional to /i(t) = /i, for all jj,, noted by 
(A 11 ). The MG undergoes a second order phase transition with symmetry breaking as the control parameter a = P/N 
is varied the system is in the symmetric phase {(A 11 ) = for all /i) if a < a c and it is in the asymmetric 

phase for a > a c . One convenient order parameter isn_ff = (A) 2 : it is equal to zero in the symmetric phase, and 
grows monotonically with a in the asymmetric phase (see Fig. ^) . One other relevant macroscopic quantity is 



the fluctuations a 2 = (A 2 ) which quantifies the performance of the agents |T^ , 0] 

Before doing any analytic calculations, it is worth looking at Figures |^ andQ which clearly show that Cavagna's 
assertion is right as long as the system is in the symmetric phase. Indeed, if (A^) = 0, the transition probabilities 
from /j, to its next neighbours are unbiased, that is e M = 0; therefore in the symmetric phase, where (A^) = for all /i, 
the frequencies of visit are uniform = l/P. Accordingly, numerical simulations show that these quantities collapse 
on the same curve. 
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FIG. 3. Comparison between the fluctuations of MG with uniformly sampled (squares) and real histories (full circles). In 
the symmetric phase, there are equal whereas they differ significantly in the asymmetric phase. Dashed and continuous lines 
are corresponding theoretical predictions; they overlap in the symmetric phase (M = 8, S — 2, 300P iterations, average over 
200 samples) 



^ee refs (Tl]|],|||l|] for more details 
2 the one with the highest score 

3 R — ^2 t p M _R M is the notation for the weighted average over the histories. 
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FIG. 4. Comparison between the available information of MG with uniformly sampled (squares) and real histories (circles). 
Dashed and continuous lines are corresponding theoretical predictions (M — 8, S — 2, 300P iterations, average over 200 
samples) 

As a increases, the critical point is crossed, and (A 11 ) ^ for some \x. The dynamics of the history is biased on 
all such histories and consequently all macroscopic quantities are significantly different: both a 2 /N and H/N are 
lower for real histories than for uniformly sampled histories. This can be understood by the facts that a 2 /N and H 
are increasing functions of a and that the biases on the De Bruijn graph of histories reduce the effective number of 
histories, that can be defined as 2~ log2 p : in other words, effective a of MG with real histories is smaller than that of 
MG with uniform histories. This explanation is indeed confirmed by Fig ||; this shows the fraction of frozen agents^] 
4> which is a decreasing function of a in the asymmetric phase. As expected from the above argument, </> of MG with 
real histories is larger than that of MG with uniformly sampled histories. 
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FIG. 5. Comparison between the fraction of frozen agents in MG with uniformly sampled (squares) and real histories 
(circles). In the symmetric phase, there are equal whereas they differ significantly in the asymmetric phase (M = 8, S = 2, 
300P iterations, average over 200 samples) 

The bias e M on a particular history can be estimated for large N: in this limit is a Gaussian variable with 
average (A^) and variance ((A^) 2 ) — (A^) 2 , leading to 



4 See they are agents that stop to be adaptative. 
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FIG. 6. e^ h of Eq Q vs real (M = 10, AT = 121, S = 2, 1000P iterations) 



Fig |6| confirms the validity of Eq. ( |11| ) . The figure also shows that e M are unevenly distributed and they are not 
equal even if the system is deep in the asymmetric phase (a ~ 8.5 in this figure). Indeed, as a function of fx, (A^) is 
a random variable with average and variance H 7 which is an increasing function of a. Since we studied diffusion of 
perturbed graphs with only one parameter e, we have to map all e M onto a scalar quantity, so that we define e as the 
non weighted averager] of e M over the histories. For large P, e can be approximated by 



e th = 2 



e 2ff 
dA , erf 



.4 



VW 2 



H) 



(12) 



Here both H and a 2 can be computed analytically with the method of refs. j|,[L2]j|] (see the appendix). However the 
solution depends on the distribution p M . In order to make Eq. ( |l2] ) a self-consistent equation for eth, we need to 
parameterize the distribution of p M by eth itself. 

We could not find ab initio the analytic form of the pdf of but Fig H shows that 



P(r) 



(A + l) 



A+l 



. r A e -(A+l)r 



r(A + i) 

is a very good approximation for the pdf of p = t/P. The parameter A is easily connected with e t h- 

1 



(r 2 ) - (r) 2 = 



1 + A 



= P'Ap' 



Hh 



1 



(13) 



(14) 



1 th 



where we used Eq. (10). This gives A ~ (1 — 2e 2 h )/e 2 h . Note that this approximation requires eth < 1/V2- 



5 This is clearly an important assumption, but the diffusion on De Bruijn graphs with one per site leads to a much greater 
complexity. As it appears on Fig H, andH, this assumption is not unrealistic. 
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FIG. 7. Distribution of the frequency of visit of the histories in the minority game. The continuous line is the best fit for a 
pdf given by Eq @ (M = 13, N = 801, 5 = 2, 400P iterations) 

This turns Eq. @ into an equation for eth, and the theory is self consistent. Figure g reports measured e and its 
approximation e t h- What clearly appears from this figure is that e is far from being negligible, and that Eth is a quite 
good approximation to e. 
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FIG. 8. e versus a = P/N (M = 8, S = 2, 300P iterations, average over 200 samples). The straight line is e t h, the theoretical 
prediction of the self consistent theory. 



We can also check the validity of Eq. (10) against the self-consistent theory. Fig ^| shows that Eq. ( |io| ) is in good 
agreement with numerical simulations as long as all histories are visited. Moreover the approximation eth for e leads 
to qualitatively similar results, but underestimates Ap 2 because eth < e (see figure 0). 
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FIG. 9. Inhomogeneity of the frequency of histories Ap 2 versus a — P/N from: numerical simulations (full circles), Eq 
( p"o[ ) with e from numerical simulations (void squares) and Eq ( ^p[ ) with e t h (continuous line); inset: average number of visited 
histories versus a; (M — 8, S = 2, 300P iterations, average over 200 samples). 

The self-consistent replica calculation for the Minority Game of refs. J^,|l2p|] with the ansatz p — t/P and r given 
by the pdf (|lj) is discussed in the appendix. Fig || and f| indicate that analytic predictions are well supported by 
numerical simulations. 

In the asymmetric phase, which is arguably the most relevant and interesting in the MG ||] , all quantities of MG 
change significantly if one replaces real histories with random uniform histories. A dependence on the frequencies 
p^ does not necessarily imply the relevance of the detailed dynamics of the histories. If the histories p, where drawn 
randomly from the "correct" distribution p^, the results would be the same (actually it suffices to know the pdf of p^). 
The problem is that the distribution p^ depends on the asymmetry {A^), which in turn depend on the microscopic 
constitution of all agents {Tl]| . In other words, p^ is a self-consistently determined quantity and hence it is only known 
a posteriori. 

VI. CONCLUSION 

We have shown that the dynamics of histories cannot be considered as irrelevant. Indeed, even for the canonical MG, 
it is relevant and cannot be replaced by randomly drawn histories. In addition, for many extensions and variations of 
the MG, the dynamics of histories is not only relevant, but crucial. 

We acknowledge fruitful discussions with Philippe Flajolet and Paolo De Los Rios. This work has been partially 
supported by the Swiss National Science Foundation under Grant Nr 20-46918.98. 

APPENDIX A: 

Let us prove by induction that 

2 k -l 

(Wo W = 7^7 J! S[2 k lJ.%P]+n,v ( A1 ) 
n=0 

It is sufficient to calculate explicitly (Wq from (Wq -1 )^,, 

p-i 

r=0 

= X! { 5 [2 k - 1 ([2^7oP])%P]+nM + S [2*=-^([2^%P]+l)%P]+n,u} 
n=0 

^ 2 k -l 

= X! ^[2 fc M%P]+«^ ( A2 ) 

since A(B%P)%P = AB%P and (2 k p + 2 k - 1 )%P = [2 k p%P] + 2 k ~ l if P = 2 M and k<m-l. 

APPENDIX B: 

In order to simplify the notations, we define 

2 C -1 

(A C ) Alj „ = 2J n%P]+n,v - <5[2 C +! n%P]+n+2" ,v (Bl) 

n=0 

This matrix is such that 
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1 if 2 C+1 ^%P < v < [2 c+1 fi%P] + 2 C } 
{X c )^ v = { 1 if [2 c+1 n%P] + 2 c <u< 2 c+1 (/i + 1)%P (B2) 
else 



With this formalism, one can write W±V as 

m—i 



W-yEi^V (B3) 



2^2 

c=0 

Let us calculate the perturbation at order 1: one has to compute ||/3(i)|| 2 in order to have an estimation of the 
typical value of a generic p^y. since the £ are uncorrected and Ylv=o ^{X 4 ) ^ v = 2 c+1 S c ,d , 

(IIPWII 2 ). = ^ E E^^ = (B4) 

^=0 c=0 

The next orders of perturbation are much harder to handle. However, for large P, one can approximate them by 
supposing that 

<l|P( fc )H 2 >« ~ (1 - lAP)<IIP( fc -i)H 2 }« = (1 ~p P)fe - (B5) 



Consequently, ~ (1 — l/P) fe / 2 -p ~ -p at leading order 



APPENDIX C: 



Since agents actually minimize H/N, one can consider this quantity as a Hamiltonian and find its ground state. 
This is possible by methods of statistical physics such as replica tric k [pT|Jl^ ]. The generalization of the calculus of 
refs |7[|l^|| to p M = t^/P drawn from the pdf given by Eq (|l^) and (|12|) is straightforward; the free energy reads in 
the thermodynamic limit 



F(0,Q,q,R, 



= (- 

\2(3 



log[l + xr] 



1+9 



7+X 



^-{RQ - rq) - i(log J dse~^ s2 ~^ z % 



(CI) 



where x = P(Q ~ l)/ a an d C = ~~ y/o/r (3(R — r). Next, the (3 — > oo limit is taken while keeping finite x an d C- O ne 
obtains 



H = 



1 + Q 



1 



X 



1 



X? 



and 



r 2 = H 



1-Q 



where Q and x take their saddle point values, given by the solution of 

Q(C) = i - 



0(0 = - 

a 



erf(C/V2) 



-i 2 



' - + X' r 



xC 

erf(C/%/2) 
a 



1 



(C2) 

(C3) 

(C4) 
(C5) 
(C6) 



Eqs (C5) and (C6), together with Eq (12), form a closed set of equations that has to be solved numerically. Note that 
as in the random histories case, x becomes infinite at the critical point, where a c = erf(£/v2). 
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